Prosodic Analysis of a German Read Corpus for a CALL System for Rehabilitation Purposes
نویسندگان
چکیده
Now-a-days Computer Assisted Language Learning (CALL) systems for rehabilitation purposes involving rhythmic analysis and prosodic analysis which are inter-related have gained much importance in speech synthesis and recognition systems. The main objective is to develop a language learning system by monitoring and evaluating the prosodic variations and content data corresponding to the pronunciation training of German and Slavic languages. The CALL system AzAR (German acronym for ‘automat for accent reduction’) which was developed at our institute laboratory provides adequate feedback regarding the pronunciation of the learner as well as the prosodic quality [1]. With regard to learners and teachers, the most important new feature is the integration of large speech corpus and multilingual speech data bases[2]. The main objective of the present work is to demonstrate the significance of limited german phoneme rhythmic based prosodic variations in frequency f0, duration and intensity and to develop the evaluation criteria for speech synthesis. This work starts with rhythmic based prosodic analysis, an accent and de-accent experiment using resynthesis and a perceptual test. The main work involves in rhythmic factors related to prosodic analysis which results for f0, duration and intensity and their significance. The improved perceptual quality of the duration and mean frequency modified phonemes proved to be a promising result for perception as demonstrated in the subjective evaluation test with resynthesis stimuli. Stress is considered as a prominent factor in this analysis because it is the basis of rhythm in all languages and it was well proposed a rhythmic continuum that does not stretch from phoneme to stress timing [5]. The speech database involves 50 male and 60 female speakers aged from 55 to 75 years. Each speaker read 105 german sentences. The data from 5 speakers was recorded in laboratory studio at 16 kHz,16 bit PCM. Our results will discuss the significance of stress and intonation, phoneme duration, intensity variation, relation among duration, f0, intensity and perception evaluation. The main application leads to develop a multimodel data-based assistance system for self learning activities for old aged people affected with brain related diseases e.g., Parkinson's disease. This work is a part of ongoing project titled "RehaVox", funded by Federal Ministry of Economics and Technology, Germany.
منابع مشابه
Towards a Computer-Aided Pronunciation Training System for German Learners of Mandarin - Prosodic Analysis
This paper reports on the continued activities towards the development of a computer-aided language learning system for German learners of Mandarin. In this experiment we used a complex corpus which consists of whole sentences and read from German students from three different years of language education and native speakers of Mandarin. A contrastive analysis of prosodic features (rhythmic and ...
متن کاملطراحی و ارزیابی یک مدل بازسازی گفتار به روش همگذاری واحدهای حساس به بافت نوایی
This paper describes the design and evaluation of prosodically-sensitive concatenative units for a Persian text-to-speech (TTS) synthesis system. Thesyllables used are prosodically conditioned in the sense that a single conventional syllable is stored as different versions taken directly from the different prosodic domains of the prosodically labeled, read sentences. The three levels of the Per...
متن کاملA rhythmic-prosodic model of poetic speech
In this paper a new approach towards the analysis of speech rhythm is presented. In the speech rhythm literature it was often discussed that rhythmic phenomena are more transparent in the metrical structure of orally produced poetry. However, up to now only a few phoneticians have worked on this special speaking style. For analyzing the rhythmic and prosodic patterns of this kind of speech, a c...
متن کاملThe effect of bilateral subthalamic nucleus deep brain stimulation (STN-DBS) on the acoustic and prosodic features in patients with Parkinson’s disease: A study protocol for the first trial on Iranian patients
Background: The effect of subthalamic nucleus deep brain stimulation (STN-DBS) on the voice features in Parkinson’s disease (PD) is controversial. No study has evaluated the voice features of PD underwent STN-DBS by the acoustic, perceptual, and patient-based assessments comprehensively. Furthermore, there is no study to investigate prosodic features before and after DBS in PD. The curren...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013